skip to main content


Search for: All records

Creators/Authors contains: "Li, Xia"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. The water discharge and sediment load have been increasingly altered by climate change and human activities in recent decades. For the Pearl River, however, long-term variations in the sediment regime, especially in the last decade, remain poorly known. Here we updated knowledge of the temporal trends in the sediment regime of the Pearl River at annual, seasonal and monthly time scales from the 1950s to 2020. Results show that the annual sediment load and suspended sediment concentration (SSC) exhibited drastically decreased, regardless of water discharge. Compared with previous studies, we also found that sediment load and SSC reached a conspicuous peak in the 1980s, and showed a significant decline starting in the 2000s and 1990s, respectively. In the last decade, however, water discharge and sediment load showed slightly increasing trends. At the seasonal scale, the wet-season water discharge displays a decreasing trend, while the dry-season water discharge is increasing. At the monthly scale, the flood seasons in the North and East Rivers typically occur one month earlier than that in the West River due to the different precipitation regimes. Precipitation was responsible for the long-term change of discharge, while human activities (e.g. dam construction and land use change) exerted different effects on the variations in sediment load among different periods. Changes in the sediment regime have exerted substantial influences on downstream channel morphology and saltwater intrusion in the Greater Bay Area. Our study proposes a watershed-based solution, and provides scientific guidelines for the sustainable development of the Greater Bay Area. 
    more » « less
  2. Deterministic compartmental models for infectious diseases give the mean behaviour of stochastic agent-based models. These models work well for counterfactual studies in which a fully mixed large-scale population is relevant. However, with finite size populations, chance variations may lead to significant departures from the mean. In real-life applications, finite size effects arise from the variance of individual realizations of an epidemic course about its fluid limit. In this article, we consider the classical stochastic Susceptible-Infected-Recovered (SIR) model, and derive a martingale formulation consisting of a deterministic and a stochastic component. The deterministic part coincides with the classical deterministic SIR model and we provide an upper bound for the stochastic part. Through analysis of the stochastic component depending on varying population size, we provide a theoretical explanation of finite size effects. Our theory is supported by quantitative and direct numerical simulations of theoretical infinitesimal variance. Case studies of coronavirus disease 2019 (COVID-19) transmission in smaller populations illustrate that the theory provides an envelope of possible outcomes that includes the field data.

     
    more » « less
  3. null (Ed.)
    API misuses are prevalent and extremely harmful. Despite various techniques have been proposed for API-misuse detection, it is not even clear how different types of API misuses distribute and whether existing techniques have covered all major types of API misuses. Therefore, in this paper, we conduct the first large-scale empirical study on API misuses based on 528,546 historical bug-fixing commits from GitHub (from 2011 to 2018). By leveraging a state-of-the-art fine-grained AST differencing tool, GumTree, we extract more than one million bug-fixing edit operations, 51.7% of which are API misuses. We further systematically classify API misuses into nine different categories according to the edit operations and context. We also extract various frequent API-misuse patterns based on the categories and corresponding operations, which can be complementary to existing API-misuse detection tools. Our study reveals various practical guidelines regarding the importance of different types of API misuses. Furthermore, based on our dataset, we perform a user study to manually analyze the usage constraints of 10 patterns to explore whether the mined patterns can guide the design of future API-misuse detection tools. Specifically, we find that 7,541 potential misuses still exist in latest Apache projects and 149 of them have been reported to developers. To date, 57 have already been confirmed and fixed (with 15 rejected misuses correspondingly). The results indicate the importance of studying historical API misuses and the promising future of employing our mined patterns for detecting unknown API misuses. 
    more » « less
  4. Coverage-based fault localization has been extensively studied in the literature due to its effectiveness and lightweightness for real-world systems. However, existing techniques often utilize coverage in an oversimplified way by abstracting detailed coverage into numbers of tests or boolean vectors, thus limiting their effectiveness in practice. In this work, we present a novel coverage-based fault localization technique, Grace, which fully utilizes detailed coverage information with graph-based representation learning. Our intuition is that coverage can be regarded as connective relationships between tests and program entities, which can be inherently and integrally represented by a graph structure: with tests and program entities as nodes, while with coverage and code structures as edges. Therefore, we first propose a novel graph-based representation to reserve all detailed coverage information and fine-grained code structures into one graph. Then we leverage Gated Graph Neural Network to learn valuable features from the graph-based coverage representation and rank program entities in a listwise way. Our evaluation on the widely used benchmark Defects4J (V1.2.0) shows that Grace significantly outperforms state-of-the-art coverage-based fault localization: Grace localizes 195 bugs within Top-1 whereas the best compared technique can at most localize 166 bugs within Top-1. We further investigate the impact of each Grace component and find that they all positively contribute to Grace. In addition, our results also demonstrate that Grace has learnt essential features from coverage, which are complementary to various information used in existing learning-based fault localization. Finally, we evaluate Grace in the cross-project prediction scenario on extra 226 bugs from Defects4J (V2.0.0), and find that Grace consistently outperforms state-of-the-art coverage-based techniques. 
    more » « less
  5. null (Ed.)
    Automated debugging techniques, including fault localization and program repair, have been studied for over a decade. However, the only existing connection between fault localization and program repair is that fault localization computes the potential buggy elements for program repair to patch. Recently, a pioneering work, ProFL, explored the idea of unified debugging to unify fault localization and program repair in the other direction for the first time to boost both areas. More specifically, ProFL utilizes the patch execution results from one state-of-the-art repair system, PraPR, to help improve state-of-the-art fault localization. In this way, ProFL not only improves fault localization for manual repair, but also extends the application scope of automated repair to all possible bugs (not only the small ratio of bugs that can be automatically fixed). However, ProFL only considers one APR system (i.e., PraPR), and it is not clear how other existing APR systems based on different designs contribute to unified debugging. In this work, we perform an extensive study of the unified-debugging approach on 16 state-of-the-art program repair systems for the first time. Our experimental results on the widely studied Defects4J benchmark suite reveal various practical guidelines for unified debugging, such as (1) nearly all the studied 16 repair systems can positively contribute to unified debugging despite their varying repairing capabilities, (2) repair systems targeting multi-edit patches can bring extraneous noise into unified debugging, (3) repair systems with more executed/plausible patches tend to perform better for unified debugging, and (4) unified debugging effectiveness does not rely on the availability of correct patches in automated repair. Based on our results, we further propose an advanced unified debugging technique, UniDebug++, which can localize over 20% more bugs within Top-1 positions than state-of-the-art unified debugging technique, ProFL. 
    more » « less
  6. null (Ed.)
  7. null (Ed.)
    Background: Research studies often rely on self-reported weight to calculate body mass index. The present study investigated how the accuracy of self-reported body weight in adolescent girls is affected by overweight/obesity, race/ethnicity, and mental health factors. Methods: In a cohort of girls who participated in the Trial of Activity for Adolescent Girls at ages 11 and 17 (n = 588), self-reported and measured weight were compared, and linear regression models were fitted to model the over- or underreporting. The Center for Epidemiological Studies-Depression Scale (CES-D) was used to calculate depressive symptom subscales for negative affect, anhedonia and somatic symptoms. Results: Allowing 3% difference between self-reported and measured weight for the correct reporting of body weight, 59.2% of girls reported their weight correctly, 30.3% underreported (−5.8 ± 4.8 kg), and 10.5% overreported (4.3 ± 3.5 kg). The average difference between self-reported and measured body weight was −1.5 ± 4.3 kg (p < 0.001). Factors for misreporting body weight were overweight (β ± SE − 2.60 ± 0.66%), obesity (β ± SE − 2.41 ± 0.71%), weight change between ages 11 and 17 (β ± SE − 0.35 ± 0.04% for each kg), height change between ages 11 and 17 (β ± SE 0.29 ± 0.10% for each cm), and negative affect (β ± SE − 0.18 ± 0.08% for each score unit). Conclusions: The difference between self-reported and measured body weight in adolescent girls is relatively small. However, the accuracy of self-reported body weight may be lower in girls with overweight or obesity, recent weight and height change, and higher negative affect. 
    more » « less